40 research outputs found

    Data pre-processing and data generation in the student flow case study

    Get PDF
    Education covers a range of sectors from kindergarten to higher education. In the education system, each grade has three possible outcomes: dropout, retention and pass to the next grade. In this work, we study the data from the Department of Statistics of Education and Science (DGEEC) of the Education Ministry. DGEEC maintains those outcomes for each school year, therefore, this study seeks a longitudinal view based on student flow. The document reports the data pre-processing, a stochastic model based on the pre-processed data and a data generation process that uses the previous model.The authors would like to thank the FCT Projects of Scientific Research and Technological Development in Data Science and Artificial Intelligence in Public Administration, 2018-2022 (DSAIPA/DS/0039/2018), for its support. LCav, PP and LCor also acknowledge support by UID/MULTI/04046/2103 center grant from FCT, Portugal (to BioISI).info:eu-repo/semantics/publishedVersio

    Quantum walks can find a marked element on any graph

    Full text link
    We solve an open problem by constructing quantum walks that not only detect but also find marked vertices in a graph. In the case when the marked set MM consists of a single vertex, the number of steps of the quantum walk is quadratically smaller than the classical hitting time HT(P,M)HT(P,M) of any reversible random walk PP on the graph. In the case of multiple marked elements, the number of steps is given in terms of a related quantity HT+(P,M)HT^+(\mathit{P,M}) which we call extended hitting time. Our approach is new, simpler and more general than previous ones. We introduce a notion of interpolation between the random walk PP and the absorbing walk P′P', whose marked states are absorbing. Then our quantum walk is simply the quantum analogue of this interpolation. Contrary to previous approaches, our results remain valid when the random walk PP is not state-transitive. We also provide algorithms in the cases when only approximations or bounds on parameters pMp_M (the probability of picking a marked vertex from the stationary distribution) and HT+(P,M)HT^+(\mathit{P,M}) are known.Comment: 50 page

    Validation of Results from Knowledge Discovery: Mass Density as a Predictor of Breast Cancer

    Get PDF
    The purpose of our study is to identify and quantify the association between high breast mass density and breast malignancy using inductive logic programming (ILP) and conditional probabilities, and validate this association in an independent dataset. We ran our ILP algorithm on 62,219 mammographic abnormalities. We set the Aleph ILP system to generate 10,000 rules per malignant finding with a recall >5% and precision >25%. Aleph reported the best rule for each malignant finding. A total of 80 unique rules were learned. A radiologist reviewed all rules and identified potentially interesting rules. High breast mass density appeared in 24% of the learned rules. We confirmed each interesting rule by calculating the probability of malignancy given each mammographic descriptor. High mass density was the fifth highest ranked predictor. To validate the association between mass density and malignancy in an independent dataset, we collected data from 180 consecutive breast biopsies performed between 2005 and 2007. We created a logistic model with benign or malignant outcome as the dependent variable while controlling for potentially confounding factors. We calculated odds ratios based on dichomotized variables. In our logistic regression model, the independent predictors high breast mass density (OR 6.6, CI 2.5–17.6), irregular mass shape (OR 10.0, CI 3.4–29.5), spiculated mass margin (OR 20.4, CI 1.9–222.8), and subject age (β = 0.09, p < 0.0001) significantly predicted malignancy. Both ILP and conditional probabilities show that high breast mass density is an important adjunct predictor of malignancy, and this association is confirmed in an independent data set of prospectively collected mammographic findings

    Hierarchy measure for complex networks

    Get PDF
    Nature, technology and society are full of complexity arising from the intricate web of the interactions among the units of the related systems (e.g., proteins, computers, people). Consequently, one of the most successful recent approaches to capturing the fundamental features of the structure and dynamics of complex systems has been the investigation of the networks associated with the above units (nodes) together with their relations (edges). Most complex systems have an inherently hierarchical organization and, correspondingly, the networks behind them also exhibit hierarchical features. Indeed, several papers have been devoted to describing this essential aspect of networks, however, without resulting in a widely accepted, converging concept concerning the quantitative characterization of the level of their hierarchy. Here we develop an approach and propose a quantity (measure) which is simple enough to be widely applicable, reveals a number of universal features of the organization of real-world networks and, as we demonstrate, is capable of capturing the essential features of the structure and the degree of hierarchy in a complex network. The measure we introduce is based on a generalization of the m-reach centrality, which we first extend to directed/partially directed graphs. Then, we define the global reaching centrality (GRC), which is the difference between the maximum and the average value of the generalized reach centralities over the network. We investigate the behavior of the GRC considering both a synthetic model with an adjustable level of hierarchy and real networks. Results for real networks show that our hierarchy measure is related to the controllability of the given system. We also propose a visualization procedure for large complex networks that can be used to obtain an overall qualitative picture about the nature of their hierarchical structure.Comment: 29 pages, 9 figures, 4 table

    Statistical Modeling of Single Target Cell Encapsulation

    Get PDF
    High throughput drop-on-demand systems for separation and encapsulation of individual target cells from heterogeneous mixtures of multiple cell types is an emerging method in biotechnology that has broad applications in tissue engineering and regenerative medicine, genomics, and cryobiology. However, cell encapsulation in droplets is a random process that is hard to control. Statistical models can provide an understanding of the underlying processes and estimation of the relevant parameters, and enable reliable and repeatable control over the encapsulation of cells in droplets during the isolation process with high confidence level. We have modeled and experimentally verified a microdroplet-based cell encapsulation process for various combinations of cell loading and target cell concentrations. Here, we explain theoretically and validate experimentally a model to isolate and pattern single target cells from heterogeneous mixtures without using complex peripheral systems.Wallace H. Coulter Foundation (Young Investigator in Bioengineering Award)National Institutes of Health (U.S.) (Grant R01AI081534)National Institutes of Health (U.S.) (Grant R21AI087107

    'I believe that the staff have reduced their closeness to patients': an exploratory study on the impact of HIV/AIDS on staff in four rural hospitals in Uganda

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Staff shortages could harm the provision and quality of health care in Uganda, so staff retention and motivation are crucial. Understanding the impact of HIV/AIDS on staff contributes to designing appropriate retention and motivation strategies. This research aimed 'to identify the influence of HIV/AIDS on staff working in general hospitals at district level in rural areas and to explore support required and offered to deal with HIV/AIDS in the workplace'. Its results were to inform strategies to mitigate the impact of HIV/AIDS on hospital staff.</p> <p>Methods</p> <p>A cross-sectional study with qualitative and quantitative components was implemented during two weeks in September 2005. Data were collected in two government and two faith-based private not-for-profit hospitals purposively selected in rural districts in Uganda's Central Region. Researchers interviewed 237 people using a structured questionnaire and held four focus group discussions and 44 in-depth interviews.</p> <p>Results</p> <p>HIV/AIDS places both physical and, to some extent, emotional demands on health workers. Eighty-six per cent of respondents reported an increased workload, with 48 per cent regularly working overtime, while 83 per cent feared infection at work, and 36 per cent reported suffering an injury in the previous year. HIV-positive staff remained in hiding, and most staff did not want to get tested as they feared stigmatization. Organizational responses were implemented haphazardly and were limited to providing protective materials and the HIV/AIDS-related services offered to patients. Although most staff felt motivated to work, not being motivated was associated with a lack of daily supervision, a lack of awareness on the availability of HIV/AIDS counselling, using antiretrovirals and working overtime. The specific hospital context influenced staff perceptions and experiences.</p> <p>Conclusion</p> <p>HIV/AIDS is a crucially important contextual factor, impacting on working conditions in various ways. Therefore, organizational responses should be integrated into responses to other problematic working conditions and adapted to the local context. Opportunities already exist, such as better use of supervision, educational sessions and staff meetings. However, exchanges on interventions to improve staff motivation and address HIV/AIDS in the health sector are urgently required, including information on results and details of the context and implementation process.</p

    HIV and Hepatitis B and C incidence rates in US correctional populations and high risk groups: a systematic review and meta-analysis

    Get PDF

    Steering Evolution with Sequential Therapy to Prevent the Emergence of Bacterial Antibiotic Resistance

    Get PDF
    The increasing rate of antibiotic resistance and slowing discovery of novel antibiotic treatments presents a growing threat to public health. Here, we consider a simple model of evolution in asexually reproducing populations which considers adaptation as a biased random walk on a fitness landscape. This model associates the global properties of the fitness landscape with the algebraic properties of a Markov chain transition matrix and allows us to derive general results on the non-commutativity and irreversibility of natural selection as well as antibiotic cycling strategies. Using this formalism, we analyze 15 empirical fitness landscapes of E. coli under selection by different β-lactam antibiotics and demonstrate that the emergence of resistance to a given antibiotic can be either hindered or promoted by different sequences of drug application. Specifically, we demonstrate that the majority, approximately 70%, of sequential drug treatments with 2–4 drugs promote resistance to the final antibiotic. Further, we derive optimal drug application sequences with which we can probabilistically ‘steer’ the population through genotype space to avoid the emergence of resistance. This suggests a new strategy in the war against antibiotic–resistant organisms: drug sequencing to shepherd evolution through genotype space to states from which resistance cannot emerge and by which to maximize the chance of successful therapy
    corecore